Recurrent Affine Transform Encoder for Image Representation

نویسندگان

چکیده

This paper proposes a Recurrent Affine Transform Encoder (RATE) that can be used for image representation learning. We propose learning architecture enables CNN encoder to learn the affine transform parameter of images. The proposed decomposes an matrix into two matrices and learns them jointly in self-supervised manner. RATE is trained by unlabeled data without any ground truth infers input images recurrently. inferred represent canonical form greatly reduce variations transforms such as rotation, scaling, translation. Different from spatial transformer network, does not need embedded other networks training with aid objectives. show achieves impressive results terms invariance translation, rotation. also classification performance enhanced more robust against distortion incorporating existing model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Affine transform resilient image fingerprinting

Affine transformations are a well-known robustness issue in many multimedia fingerprinting systems. Since it is quite easy with modem computers to apply affine transformations to audio, image and video content, there is an obvious necessity for affine transformation resilient fingerprinting. In this paper we present a new method for affine transformation resilient fingerprints that is based upo...

متن کامل

The finite ridgelet transform for image representation

The ridgelet transform was introduced as a sparse expansion for functions on continuous spaces that are smooth away from discontinuities along lines. We propose an orthonormal version of the ridgelet transform for discrete and finite-size images. Our construction uses the finite Radon transform (FRAT) as a building block. To overcome the periodization effect of a finite transform, we introduce ...

متن کامل

Image Representation Via a Finite Radon Transform

{ This paper presents a model of nite Radon transforms composed of Radon projections. The model generalizes to nite groups projections in the classical Radon transform theory. The Radon projector averages a function on a group over cosets of a subgroup. Reconstruction formulae formally similar to the convolved backprojection ones are derived and an iterative reconstruction technique is found to...

متن کامل

Astronomical image representation by the curvelet transform

We outline digital implementations of two newly developed multiscale representation systems, namely, the ridgelet and curvelet transforms. We apply these digital transforms to the problem of restoring an image from noisy data and compare our results with those obtained via well established methods based on the thresholding of wavelet coefficients. We show that the curvelet transform allows us a...

متن کامل

A New Affine Invariant Image Transform Based on Ridgelets

In this paper we present a new affine invariant image transform, based on ridgelets. The proposed transform is directly applicable to segmented image patches. The new method has some similarities with the previously proposed Multiscale Autoconvolution, but it will offer a more general framework and possibilities for variations. The obtained transform coefficients can be used in affine invariant...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3150340